classifier 0
Study of Robust Features in Formulating Guidance for Heuristic Algorithms for Solving the Vehicle Routing Problem
Herdianto, Bachtiar, Billot, Romain, Lucas, Flavien, Sevaux, Marc
Combinatorial optimization problems, such as Vehicle Routing Problems (VRP), are important in real-world applications as they search for efficient solutions to minimize costs. Despite extensive research over decades, achieving optimal solutions remains a challenge (Laporte, 2009). Furthermore, the unique constraints of various problem variants demand specialized algorithms. The development of these algorithms is complex, making Machine Learning (ML) an attractive approach to improving the existing algorithms. Routing algorithms are typically divided into two categories: exact algorithms that offer global optimum but require many computational resources and heuristics methods for practical, real-world applications that mostly find a near-optimal solution. While most heuristics rely on human-designed strategies (Lucas et al., 2020), ML offers a new approach improving algorithm. Moreover, the selection of features influenced by these ML models plays a critical role in effectively enhancing heuristic performances (Arnold and S orensen, 2019b; Arnold and S orensen, 2019a; Lucas, Billot, and Sevaux, 2019). Understanding the predictions of an ML model can be as crucial as the accuracy of the prediction itself in many applications (Lundberg and Lee, 2017).
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.92)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
Time-EAPCR-T: A Universal Deep Learning Approach for Anomaly Detection in Industrial Equipment
Liang, Huajie, Wang, Di, Lu, Yuchao, Song, Mengke, Liu, Lei, An, Ling, Liang, Ying, Ma, Xingjie, Zhang, Zhenyu, Zhou, Chichun
With the advancement of Industry 4.0, intelligent manufacturing extensively employs sensors for real-time multidimensional data collection, playing a crucial role in equipment monitoring, process optimisation, and efficiency enhancement. Industrial data exhibit characteristics such as multi-source heterogeneity, nonlinearity, strong coupling, and temporal interactions, while also being affected by noise interference. These complexities make it challenging for traditional anomaly detection methods to extract key features, impacting detection accuracy and stability. Traditional machine learning approaches often struggle with such complex data due to limitations in processing capacity and generalisation ability, making them inadequate for practical applications. While deep learning feature extraction modules have demonstrated remarkable performance in image and text processing, they remain ineffective when applied to multi-source heterogeneous industrial data lacking explicit correlations. Moreover, existing multi-source heterogeneous data processing techniques still rely on dimensionality reduction and feature selection, which can lead to information loss and difficulty in capturing high-order interactions. To address these challenges, this study applies the EAPCR and Time-EAPCR models proposed in previous research and introduces a new model, Time-EAPCR-T, where Transformer replaces the LSTM module in the time-series processing component of Time-EAPCR. This modification effectively addresses multi-source data heterogeneity, facilitates efficient multi-source feature fusion, and enhances the temporal feature extraction capabilities of multi-source industrial data.Experimental results demonstrate that the proposed method outperforms existing approaches across four industrial datasets, highlighting its broad application potential.
- Machinery > Industrial Machinery (0.41)
- Information Technology (0.34)
OV-HHIR: Open Vocabulary Human Interaction Recognition Using Cross-modal Integration of Large Language Models
Ray, Lala Shakti Swarup, Zhou, Bo, Suh, Sungho, Lukowicz, Paul
Understanding human-to-human interactions, especially in contexts like public security surveillance, is critical for monitoring and maintaining safety. Traditional activity recognition systems are limited by fixed vocabularies, predefined labels, and rigid interaction categories that often rely on choreographed videos and overlook concurrent interactive groups. These limitations make such systems less adaptable to real-world scenarios, where interactions are diverse and unpredictable. In this paper, we propose an open vocabulary human-to-human interaction recognition (OV-HHIR) framework that leverages large language models to generate open-ended textual descriptions of both seen and unseen human interactions in open-world settings without being confined to a fixed vocabulary. Additionally, we create a comprehensive, large-scale human-to-human interaction dataset by standardizing and combining existing public human interaction datasets into a unified benchmark. Extensive experiments demonstrate that our method outperforms traditional fixed-vocabulary classification systems and existing cross-modal language models for video understanding, setting the stage for more intelligent and adaptable visual understanding systems in surveillance and beyond.
- Europe > Germany > Rhineland-Palatinate > Kaiserslautern (0.05)
- North America > United States > Oregon > Multnomah County > Portland (0.04)
- Asia > China > Hunan Province > Changsha (0.04)
HalluCana: Fixing LLM Hallucination with A Canary Lookahead
Li, Tianyi, Dayanik, Erenay, Tyagi, Shubhi, Pierleoni, Andrea
In this paper, we present HalluCana, a canary lookahead to detect and correct factuality hallucinations of Large Language Models (LLMs) in long-form generation. HalluCana detects and intervenes as soon as traces of hallucination emerge, during and even before generation. To support timely detection, we exploit the internal factuality representation in the LLM hidden space, where we investigate various proxies to the LLMs' factuality self-assessment, and discuss its relation to the models' context familiarity from their pre-training. On biography generation, our method improves generation quality by up to 2.5x, while consuming over 6 times less compute.
- North America > United States > Washington > King County > Seattle (0.14)
- Europe > France (0.05)
- Asia > Singapore (0.04)
- (11 more...)
- Research Report (0.50)
- Personal (0.46)
- Leisure & Entertainment (1.00)
- Media > Music (0.93)
- Government > Regional Government (0.67)
Kolmogorov Arnold Networks in Fraud Detection: Bridging the Gap Between Theory and Practice
This study evaluates the applicability of Kolmogorov-Arnold Networks (KAN) in fraud detection, finding that their effectiveness is context-dependent. We propose a quick decision rule using Principal Component Analysis (PCA) to assess the suitability of KAN: if data can be effectively separated in two dimensions using splines, KAN may outperform traditional models; otherwise, other methods could be more appropriate. We also introduce a heuristic approach to hyperparameter tuning, significantly reducing computational costs. These findings suggest that while KAN has potential, its use should be guided by data-specific assessments.
- North America > United States > New York (0.04)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- Law Enforcement & Public Safety > Fraud (0.88)
- Banking & Finance (0.68)
Detection of Malaria Vector Breeding Habitats using Topographic Models
Treatment of stagnant water bodies that act as a breeding site for malarial vectors is a fundamental step in most malaria elimination campaigns. However, identification of such water bodies over large areas is expensive, labour-intensive and time-consuming and hence, challenging in countries with limited resources. Practical models that can efficiently locate water bodies can target the limited resources by greatly reducing the area that needs to be scanned by the field workers. To this end, we propose a practical topographic model based on easily available, global, high-resolution DEM data to predict locations of potential vector-breeding water sites. We surveyed the Obuasi region of Ghana to assess the impact of various topographic features on different types of water bodies and uncover the features that significantly influence the formation of aquatic habitats. We further evaluate the effectiveness of multiple models. Our best model significantly outperforms earlier attempts that employ topographic variables for detection of small water sites, even the ones that utilize additional satellite imagery data and demonstrates robustness across different settings.
- North America > United States (0.29)
- Africa > Ghana (0.25)
- Africa > Kenya > Western Province (0.05)
- (3 more...)
The MRI Scanner as a Diagnostic: Image-less Active Sampling
Du, Yuning, Dharmakumar, Rohan, Tsaftaris, Sotirios A.
Despite the high diagnostic accuracy of Magnetic Resonance Imaging (MRI), using MRI as a Point-of-Care (POC) disease identification tool poses significant accessibility challenges due to the use of high magnetic field strength and lengthy acquisition times. We ask a simple question: Can we dynamically optimise acquired samples, at the patient level, according to an (automated) downstream decision task, while discounting image reconstruction? We propose an ML-based framework that learns an active sampling strategy, via reinforcement learning, at a patient-level to directly infer disease from undersampled k-space. We validate our approach by inferring Meniscus Tear in undersampled knee MRI data, where we achieve diagnostic performance comparable with ML-based diagnosis, using fully sampled k-space data. We analyse task-specific sampling policies, showcasing the adaptability of our active sampling approach. The introduced frugal sampling strategies have the potential to reduce high field strength requirements that in turn strengthen the viability of MRI-based POC disease identification and associated preliminary screening tools.
- South America > Peru > Lima Department > Lima Province > Lima (0.04)
- Europe > United Kingdom (0.04)
- North America > United States > Indiana > Marion County > Indianapolis (0.04)
- (3 more...)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.68)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.54)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Exploring the Determinants of Pedestrian Crash Severity Using an AutoML Approach
Rafe, Amir, Singleton, Patrick A.
This study investigates pedestrian crash severity through Automated Machine Learning (AutoML), offering a streamlined and accessible method for analyzing critical factors. Utilizing a detailed dataset from Utah spanning 2010-2021, the research employs AutoML to assess the effects of various explanatory variables on crash outcomes. The study incorporates SHAP (SHapley Additive exPlanations) to interpret the contributions of individual features in the predictive model, enhancing the understanding of influential factors such as lighting conditions, road type, and weather on pedestrian crash severity. Emphasizing the efficiency and democratization of data-driven methodologies, the paper discusses the benefits of using AutoML in traffic safety analysis. This integration of AutoML with SHAP analysis not only bolsters predictive accuracy but also improves interpretability, offering critical insights into effective pedestrian safety measures. The findings highlight the potential of this approach in advancing the analysis of pedestrian crash severity.
- North America > United States > Utah > Cache County > Logan (0.04)
- South America > Colombia (0.04)
- North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
- Europe > United Kingdom (0.04)
- Transportation > Ground > Road (1.00)
- Health & Medicine (0.93)
Predictive Maintenance of Armoured Vehicles using Machine Learning Approaches
Sengupta, Prajit, Mehta, Anant, Rana, Prashant Singh
Armoured vehicles are specialized and complex pieces of machinery designed to operate in high-stress environments, often in combat or tactical situations. This study proposes a predictive maintenance-based ensemble system that aids in predicting potential maintenance needs based on sensor data collected from these vehicles. The proposed model's architecture involves various models such as Light Gradient Boosting, Random Forest, Decision Tree, Extra Tree Classifier and Gradient Boosting to predict the maintenance requirements of the vehicles accurately. In addition, K-fold cross validation, along with TOPSIS analysis, is employed to evaluate the proposed ensemble model's stability. The results indicate that the proposed system achieves an accuracy of 98.93%, precision of 99.80% and recall of 99.03%. The algorithm can effectively predict maintenance needs, thereby reducing vehicle downtime and improving operational efficiency. Through comparisons between various algorithms and the suggested ensemble, this study highlights the potential of machine learning-based predictive maintenance solutions.
- Asia > India (0.05)
- North America > United States > Texas > Kleberg County (0.04)
- North America > United States > Texas > Chambers County (0.04)
- Asia > China > Jiangsu Province > Nanjing (0.04)
- Government > Military (1.00)
- Health & Medicine (0.89)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.92)
- (2 more...)
Benchmarking the Effectiveness of Classification Algorithms and SVM Kernels for Dry Beans
Mehta, Anant, Sengupta, Prajit, Garg, Divisha, Singh, Harpreet, Diamand, Yosi Shacham
Plant breeders and agricultural researchers can increase crop productivity by identifying desirable features, disease resistance, and nutritional content by analysing the Dry Bean dataset. This study analyses and compares different Support Vector Machine (SVM) classification algorithms, namely linear, polynomial, and radial basis function (RBF), along with other popular classification algorithms. The analysis is performed on the Dry Bean Dataset, with PCA (Principal Component Analysis) conducted as a preprocessing step for dimensionality reduction. The primary evaluation metric used is accuracy, and the RBF SVM kernel algorithm achieves the highest Accuracy of 93.34%, Precision of 92.61%, Recall of 92.35% and F1 Score as 91.40%. Along with adept visualization and empirical analysis, this study offers valuable guidance by emphasizing the importance of considering different SVM algorithms for complex and non-linear structured datasets.
- Asia > India (0.05)
- Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.05)
- Oceania > New Zealand > North Island > Waikato (0.04)
- North America > United States (0.04)